MENLI: Robust Evaluation Metrics from Natural Language Inference

نویسندگان

چکیده

Abstract Recently proposed BERT-based evaluation metrics for text generation perform well on standard benchmarks but are vulnerable to adversarial attacks, e.g., relating information correctness. We argue that this stems (in part) from the fact they models of semantic similarity. In contrast, we develop based Natural Language Inference (NLI), which deem a more appropriate modeling. design preference-based attack framework and show our NLI much robust attacks than recent metrics. On benchmarks, outperform existing summarization metrics, below SOTA MT However, when combining with obtain both higher robustness (15%–30%) quality as measured (+5% 30%).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Natural Language Inference from Multiple Premises

We define a novel textual entailment task that requires inference over multiple premise sentences. We present a new dataset for this task that minimizes trivial lexical inferences, emphasizes knowledge of everyday events, and presents a more challenging setting for textual entailment. We evaluate several strong neural baselines and analyze how the multiple premise task differs from standard tex...

متن کامل

Natural language directed inference from ontologies

This paper presents an investigation into the problem of content determination in natural language generation (NLG), using as an example the problem of determining what to say when asked “What is an A?”, where A is a concept defined in an OWL ontology. It is shown that a naive approach to this problem, which just presents a set of the stated axioms, will often inadvertantly violate maxims of co...

متن کامل

Robust Natural Language Analysis

Our basic goal is the development of more robust systems for extracting information from natural language text. A robust system is one which is able to extract at least partial information despite the presence of ill-formed or unexpected syntactic, semantic, or discourse structures. Our approach has two aspects: First, we incorporate a rich set of syntactic, semantic, and discourse constraints,...

متن کامل

Natural logic and natural language inference

We propose a model of natural language inference which identifies valid inferences by their lexical and syntactic features, without full semantic interpretation. We extend past work in natural logic, which has focused on semantic containment and monotonicity, by incorporating both semantic exclusion and implicativity. Our model decomposes an inference problem into a sequence of atomic edits lin...

متن کامل

Natural Language Inference in Coq

In this paperwe propose away to dealwith natural language inference (NLI) by implementing Modern Type Theoretical Semantics in the proof assistant Coq. The paper is a first attempt to deal with NLI and natural language reasoning in general by using the proof assistant technology. Valid NLIs are treated as theorems and as such the adequacy of our account is tested by trying to prove them. We use...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the Association for Computational Linguistics

سال: 2023

ISSN: ['2307-387X']

DOI: https://doi.org/10.1162/tacl_a_00576